dist_d(PC ab ,PC cd ) = ± 



£ dist _d(prob(conf(PC ab )), , prob(conf(PC cd )), 



Note: j refers to the components of the vector prob(conf(. . .)). Alternatively one can use 
the metric: 

M 

dist_d(PCab,PC c d) = 2>0 - Pi) 

1=1 

where the summation is over the attributes and p is the vector of probabilities for the 
discrete attribute if we merge conf(PC a b) and conf(PC cd ). In all cases distance values near 
0 mean high similarity. 

Replace the paragraph beginning at page 18, line 1 and extending through page 18 
line 7 with the following paragraph: 

Now one can compute continuous and discrete distances between super-clusters: 

dist.c^Vj) = — £ Z\W+\W^st_c(PC+ 9 PC«) 

dist.dcv.Vj) = — Ly X ZI^||^|^_^(^,pc c ,) . 

There are two stopping criteria for the merging: 1) Merging until we get a certain number 
of clusters - K -given by user. 2) Merging until the distances are larger then a threshold - 
maxMergeThreshold - supplied by a user. 

Remarks 

The changes to the text are made to make the notations with respect to distance 
metrics consistent with each other. An attachment to this amendment contains the two 
replacement paragraphs with additions underlined and deletions bracketed. 

Respectfully Submitted, 




Stephen J. Schultz 
Reg No. 29,108 
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Changes shown with underlining and bracketing 

Paragraph beginning at page 17, line 12* 

Here, \i is the mean of the cluster with index ab. 
Distance between two discrete cluster attributes is : 

dist_d(conf(PC ab ) J conf(PCcd)) = I prob(conf(PC a b)) - prob(conf(PC c d)) | . One 
then must take into account all the M attributes that are discrete: 

M 

^dist _d(prob(conf(PC ab )),. , prob(conf(PC cd )), , 

Note: j refers to the components of the vector prob(conf(. . .)). Alternatively one can use 
the metric: 

M 

dist_d([prob(conf(PCab)),prob(conf(PC C d))] PG^ PCJ = £ />,(*- Pi) 

1=1 

where the summation is over the attributes and p is the vector of probabilities for the 
discrete attribute if we merge conf(PCab) and conf(PC C d). In all cases distance values near 
0 mean high similarity. 

Paragraph beginning page 18, line 1 and extending through page 18, line 7. 

Now one can compute continuous and discrete distances between super-clusters: 

dist^V.Vj) = — j-r X ZI^|^|^_^C^,PC C ,) 

dist_d(v,Vj) = -i-j x 21 1^-* 11"^— k*^ .^/aco^c^c-* x «»src«^-r » i gCab^cdi 

There are two stopping criteria for the merging: 1) Merging until we get a certain number 
of clusters - K -given by user. 2) Merging until the distances are larger then a threshold - 
maxMergeThreshold - supplied by a user. 



dist_d{PC ab ,PC cd ) = — 
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